initial point
- Asia > Middle East > Jordan (0.04)
- Asia > Middle East > Israel (0.04)
- Asia > China (0.04)
- Asia > Middle East > Jordan (0.04)
- Asia > Middle East > Israel (0.04)
- Asia > China (0.04)
- Asia > China > Shanghai > Shanghai (0.04)
- North America > United States > New York (0.04)
- Asia > Middle East > Jordan (0.04)
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- Asia > China > Shanghai > Shanghai (0.04)
- (3 more...)
- Asia > Middle East > Jordan (0.05)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Asia > Singapore (0.04)
- Asia > China > Shanghai > Shanghai (0.04)
- North America > United States (0.14)
- North America > Canada (0.04)
We thank all the reviewers for their insightful and encouraging feedback
We thank all the reviewers for their insightful and encouraging feedback. Due to the discrete nature of COMBO's search space, the implementation detail is slightly different. In contrast, in COMBO's combinatorial graphs, we have spray vertices. As R2 suggested, this heuristic promotes exploitation. Using random vertices for exploration is similar to Spearmint.
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.53)
- Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.38)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.36)
Appendix
Bound of GEM gradient estimation error ( Section 3.2) We show a general proposition in VI (or other measure approximation methods). However, it's not easy to model an arbitrary prior distribution with effective and efficient Bayesian inference. For single cluster of tasks, we show empirical evidences in Appendix C that there exist such kind of a distribution. In this work we focus on the uni-modal situation and leave the multi-modal situation to future work. B.3. co-ordinate descent ( Section 3.2) Following the ELBO property mentioned in Section 3.2 we have max Line 4 of Subroutine GEM-BML and Line 10 of Algorithm 1. B.4. recasting related works to our framework ( Section 4) For simpicity, we first set up some notations as follows: sg: stop gradient D This tensor building and backProps procedure has several drawbacks.